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Summary. We present results of a survey of public transport networks (PTNs) 
of selected 14 major cities of the world with PTN sizes ranging between 2000 and 
46000 stations and develop an evolutionary model of these networks. The structure 
of these PTNs is revealed in terms of a set of neighbourhood relations both for the 
routes and the stations. The networks defined in this way display distinguishing 
properties due to the constraints of the embedding 2D geographical space and the 
structure of the cities. In addition to the standard characteristics of complex net- 
works like the number of nearest neighbours, mean path length, and clustering we 
observe features specific to PTNs. While other networks with real-world links like 
cables or neurons embedded in two or three dimensions often show similar behavior, 
these can be studied in detail in our present case. Geographical data for the routes 
reveal surprising self-avoiding walk properties that we relate to the optimization of 
surface coverage. We propose and simulate an evolutionary growth model based on 
effectively interacting self-avoiding walks that reproduces the key features of PTN. 



1 Introduction 

Urban public transport networks (PTNs) share general features of other transporta- 
tion networks like airport, railroad networks, power grids, etc. [1]. The most obvious 
common features are their embedding into a 2D space, evolutionary growth, and op- 
timization. The evolution of specific local transportation networks is closely related 
to that of the town or region in which they are embedded. However, statistically, 
some general overall features such as their fractal dimensions have been observed [2]. 
Modeling different aspects of transportation network structure and functioning helps 
to understand and optimize various processes that occur on these networks as well 
as to improve their planning. In this paper, we will consider a PTN from the point 
of view of complex network theory [1, 3-5] quantifying statistical features of their 
structure and proposing a model that reproduces their key features. To exemplify 
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these, we survey the empirical analysis of PTNs of 14 major cities of the world (see 
Refs. [13, 16] for more details). While the empirical analysis of PTNs of different 
cities has been subject of several studies [6-13, 15] (see Sec. 2), we are not aware of 
previous attempts to specifically model the network evolution of a PTN in the way 
we propose here. The set-up of the paper is as follows: in the next section we discuss 
common features of PTNs and list some numbers that quantify these features; sec- 
tion 3 we describe our model for PTN evolution and give supporting arguments; in 
section 4 we compare some characteristics of real and simulated PTNs; conclusions 
and an outlook are given in section 5. 



2 Empirical analysis of PTNs: statistical properties 

A distinct feature of our study is that we interpret the PTN as a network of all 
means of public transport (buses, trams, subway, etc.) offered in a given city. Several 
previous studies have analyzed specific sub-networks of PTNs. The Boston [6-9] an 
Vienna [9] subway networks may serve as examples. However, each particular sub- 
network (e.g. the network of buses, trams, or subways) is not a closed system: it 
is a subgraph of a wider transportation system of a city, or as we call it here, 
of a PTN. Therefore to understand and describe the properties of transport in a 
city it is important to deal with the complete PTN, not restricting the analysis 
to specific parts. Indeed, extending the restricted subway network to the "subway 
-I- bus" network drastically changes the network properties, as it was shown for 
Boston [7,8]. 

Another important quantity that certainly restricts the reliability of grneraliza- 
tions drawn from trends observed for specific PTNs of difi^erent cities is the statistics, 
i.e. the size of the observed PTNs. The numbers of stations A'^ in the PTNs analyzed 
so far ranged from several decades (subway networks of Boston, N = 124 [8], and 
Vienna, A = 76 [9]) to several thousands as in the PTN analysis of 22 Polish cities 
with up to 2811 stations [11] or bus-transport networks of three Chinese cities with 
up to A = 3938 stations. In our sampling [10, 13] we have chosen PTNs of 14 major 
cities of the world of various size with A ranging from 1544 (Diisseldorf) to 46244 
(Los Angeles), see Table 1. 




Fig. 1. Different representations of a PTN. L'-space: nodes correspond to stations, 
links show different routes. L-space: the same as L' but multiple links are reduced to 
single ones. P-space: any two stations (nodes) are connected when they are serviced 
by a common route. B-space: routes (open discs) are linked to those stations (filled 
discs) that they service. C-space: nodes correspond to routes, links show that two 
routes share a common station. Note that by keeping multiple links in P and C one 
may also define P' and C'-spaces. 
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Here, we define different graph representations ('spaces') for a given PTN in 
terms of nodes (vertices) and links (edges) as illustrated in Fig. 1. The primary 
network topology is defined by a set of routes each servicing an ordered series of 
given stations, see the sketch labeled L' in Fig. 1 Based on this we define a graph 
(L-space) representing each station by a node and linking any two that are served 
consecutively by at least one route. More generally linking any two stations serviced 
by a common route wo define the P-spacc [11, 17] representation. The interrelation 
of the routes in turn is described by a complementary C-space representation where 
now the nodes represent routes and linking any two that service a common station. 
Prom Fig. 1 it is easy to verify, that the last two representations correspond to 
one-mode projections of a bipartite B-space represenation with nodes of two types, 
representing either a station or a route linking each route with all stations it services. 

Table 1. PTN characteristics in L- and P- representations, see text for definitions. 
N: number of stations, M: number of routes, the scale k or the exponent 7 of fits 
to the node degree distribution p{k) to the laws (1) or (2) respectively, C: ratio of 
the mean clustering coefficient to its random graph value, maximal i, and mean i 
shortest path length. Characteristics in P-space representations are indicated by a 
subscript 'p'. See Refs. [13,16] for more data and sources. 
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In our analysis, we are interested in different features of the PTN as measured 
when represented in the above defined spaces. It is worth to mention here, that these 
standard network characteristics measured in different spaces turn out to be specific 
of practical value in judging about the quality of public transport in a given city. 
The particular quantities wo analyze here are the maximal and mean shortest path 
length £ and J, the clustering coefficient C, and the betweenness Cs; for definitions 
see Ref. [14] in this volume. Table 1 gives some of these quantities for the cities 
analyzed in L- and P- representations. One can see that the above networks are 
highly clustered small worlds characterized by small shortest path lengths and large 
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ratios C of the mean clustering coefficient relative to its value Cer = 2M/iV^ on a 
random graph with the same numbers of nodes A'^ and links M. 

To classify the node degree distributions p{k) we performed fits to both an 
exponential function 

p{k) = Ae-'/\ (1) 

as well as to a power law: 

p{k) = Bk~^. (2) 
The result of the better fit together with the value of the fit parameters fc or 7 are 
shown in table 1. One can see that the L-space node degree distribution of a part of 
these PTNs (8 out of 14) is governed by power laws, indicating scale-free properties. 
For the PTNs of several cities, this fact has also been described in Refs. [10,11]. The 
remarkable feature of the data shown in table 1 is that some PTNs (3 out of 14) show 
scale-free behavior even in P-space. As examples compare the degree distributions 
p{k) of the PTNs of Paris and Sydney in Fig. 2. WhereEis the Paris PTN is scale-free 
both in L- and P-space, the PTN of Sydney is scale-free in L-space only. Note that 
to reduce the noise in the data we plot the P-space cumulative degree distribution 

00 

P{k) = ^p{q). (3) 

q=k 
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Fig. 2. Node degree distributions for Paris (first row) and Sydney (second row) 
PTNs. Left column: degree distributions p{k) in L-space. Right column: cumulative 
degree distributions (3) in P-space. 

In Fig. 3 plot the mean bctwccnncss value Csik) (sec [14] in this volume) as 
function of the node degree k in in C-, L-, P-, and B- spaces (similar behaviour is 
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found for the networks of other cities) . One definitely sees a pronounced correlation 
and a tendency for power-law behaviour: a phenomenon observed previously for 
several other networks [11, 18, 19]. 
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Fig. 3. Betweenness-degree correlations for Paris PTN in C-, L-, P-, and B-spaces. 



In addition to the standard characteristics of complex networks discussed above, 
one can introduce some characteristics which arc specific to PTNs. One of them is 
due to the fact that very often several routes go in parallel and pass L consecutive 
stations. In particular, the notion of harness was introduced in Ref. [13] to quantify 
this behaviour. In Fig. 4 we show the harness distribution P[L, R): the number of 
sequences of L consecutive stations that are serviced by R parallel routes for the 
PTNs of Paris (a) and of Sydney (b). The log-log plot indicates scale free properties 
of this distribution. We have found similar behaviour for the majority of the cities 
under consideration. 

Following this analysis of PTNs of different cities of the world and having at 
hand the numerical data on different features of these networks, let us propose a 
model, that may reproduce most of these features. In particular, the model should 
be capable to discriminate between different types of behaviour observed so far in 
PTNs (an example may be the exponential and power-law node degree distributions 
for p{k) observed for different cities). 



3 Evolutionary Model of PTNs 



To model the properties of PTNs we have proposed [13] an evolutionary growth 
model for these networks along the following lines: We model the grid of streets by a 
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Fig. 4. Harness distribution for Paris (a), for Sydney (b), and for a simulated city 
(c). 



square lattice and allow every lattice site r (street corner) to be a potential station 
visited by say k{r) routes. The routes are modeled as self-avoiding walks (SAWs) 
on this lattice. The rules of our model are the following: 

1. First route: construct a SAW of length n starting at an arbitrary site. 

2. Subsequent routes: 

- (i) choose a terminal station on lattice site r with probability q oc k{r) + a; 

- (ii) choose a subsequent station of this route at a neighboring site r' with 
probability q oc k{r') + b; 

- (iii) repeat step (ii) until the walk has reached n stations, in case of self- 
intersection discard the walk and restart with step (i). 

3. Repeat step 2 until M routes are created. 

The above rules resemble the preferential attachment growth rule [20] favoring 
high-degree nodes when linking new nodes to a network. The principal difference of 
our algorithm is that at each step (2ii) we link an existing station to a neighboring 
site which does not need to be empty. New stations are then only added at the 
frontier of the PTN cluster while high degree nodes (hubs) accumulate at its center. 
The choice of a SAWs to model the routes may seem odd at first sight, however, the 
fractal dimensions measured in PTNs [2, 13] are compatible with 2D SAW behaviour 
for the routes and it is obvious that a single PTN route very seldom intersects itself. 
Moreover, the scaling properties of SAWs on disordered lattices do not change, 
provided the disorder is short-range correlated [21]. In our application this means, 
that even the presence of certain geographical constraints and deviations from the 
square lattice still allow for a SAW description of a PTN route. 

The above described generating procedure results in a simulated PTN of M 
routes, each consisting of n stations. In Fig. 5 we show several typical PTN config- 
urations of simulated cities on a 300 x 300 square lattice with M = 500, n = 50 and 
different values of a and b. The parameters a and b allow to discriminate between 
different regimes of the network evolution. Setting a = limits every new route 
to start from an already existing station. On the contrary, for a 7^ 0, the terminal 
station of a new route may be situated at any lattice site. Therefore, increasing a 
allows for PTNs that consist of more than one component (c.f. Figs. 5a and 5b). 
The parameter b on the other hand tunes the evolution of the routes: a small value 
of 6 forces the routes to propagate in parallel (harnessed) while increasing 6 results 
in routes that cover more sites of the lattice (see Figs. 5c, 5d correspondingly). In 
the next section we will investigate how the numerical characteristics of the modeled 
PTNs correspond to those observed for real cities. 
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Fig. 5. Different simulated PTN maps of 'cities' of size 300 x 300 with M = 500 
routes of n = 50 stations each, a): a = 0.0001,6 = 1, b): a = 0.001,6 = 1, c): 
a = 0,6 = 0.1, d): a = 0,6 = 0.5. 



4 Simulation Results 



Postponing a more detailed analysis of the numerical simulations based on the model 
of section 3 to a separate publication [16] , we focus here on several principal features 
of PTNs and deomstrate how they are reproduced by our model. One of them is the 
behaviour of the node degree distribution p{k). As it was shown in section 2, the 

behaviour of this function varies for PTNs of different cities. In L-spacc one often 
observes p{k) to be of a power-law type (2) [10-13], however sometimes it is governed 
by an exponential decay (1). Moreover, recently these two types of behaviour were 
also observed in P-space [13] . A distinguished feature of our model is that depending 
on the values of the evolution parameters a, b it discriminates between a power-law 
and an exponential p{k) in P-space. Note that on the square lattice the L-space 
degrees are limited to A: < 4 ruling out such an analysis. As an example, in Fig. 6 
we show the cumulative node degree distribution P{k), see Eq. 3, for PTNs of two 
simulated cities in the P-spacc. Changing the value of parameter 6 for fixed a one 
passes from an exponential (a straight line in the log-linear plot in Fig. 6a) to a 
power- law regime (a straight line in the log- log plot in Fig. 6b). 

Another specific feature of real world PTNs that is nicely reproduced by our 
model is the harnessing effect. In Fig. 3c we show the harness distribution P{L, R) 
for the PTN of a simulated city on 300 x 300 sites with M = 500 and n = 50 at 
a = 0, 6 = 0.5 in comparison with the same quantity for the PTNs of Paris (Fig. 3a) 
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a) b) 

Fig. 6. P-spacc cumulative degree distribution P{k) for diflerent simulated cities of 
300 X 300 sites with M = 500 and n = 50. a) a = 0, 6 = 0.5: exponential in P-space, 
b) a = 0, 6 = 0.1: power law in P-space. 

and of Sydney (Fig. 3b) . Again, as in the case of real- world PTNs one may speculate 
about power-law behaviour. Note however, that neither for the node degree nor for 
the harness distribution we so far find a simple relation between the exponents 
that may govern the scaling and the model evolution parameters a, b. Finally, let us 
compare the betweenness- degree correlation. In the same way as for the PTN of Paris 
(Fig. 3), we plot this function in Fig. 7 for different representations for a simulated 
city of 300 X 300 sites with M = 500 and n = 50 for the evolution parameters 
o = 0, 6 = 0.5. One can see an overall qualitative agreement between the behaviour 
observed for the real-world network and the simulated one (for L see discussion 
above) . In this context it is worth to mention that the simultaneous use of different 
representations (spaces) serves as a useful tool to quantify the correspondence and 
differences between real word networks as well as simulated ones. 



5 Conclusions and outlook 

The small-world properties of public transport networks are an everyday experience: 
it is easy to reach almost any given place in a city with only a small number of 
changes of transport (from table 1 it follows e.g. that the connection between any two 
stations in Paris implies on average ip — 1 = 1.8 changes), the scale-free properties 
of these networks arc not that evident. Even more, scale free properties for these 
networks have sometimes been doubted. The results of our empirical analysis of 
PTNs of 14 major cities of the world [13,15,16] together with the empirical data for 
several other cities [6-12] however give a strong evidence of the fact that scale-free 
behaviour emerges in many PTNs. Apart from our model we currently cannot give 
more general arguments for this this behaviour. Furthermore, not all PTNs seem to 
exhibit power-law node degree distributions, as some display rather an exponential 
decay of p{k), see Table 1. Inspired by this observation we developed an evolutionary 
model of self-avoiding walks on 2D lattice which discriminates between the above 
types of behaviour and recovers a number of other basic features of the PTNs. 
The model applies the idea of the preferential attachment scenario [20], however 
with specific differences to that standard scenario: as far as the PTNs constitute 
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Fig. 7. Betweenness-degrce correlation for the simulated city of 300 x 300 sites with 
M = 500 and n = 50 in C-, L-, P-, and B-spaces. a = Q,b = 0.5. 



an example of ever evolving networks, such a mechanism is not unlikely to play a 
role in their growth. In more general terms, scale-free networks have been shown 
in certain situation to minimize both the effort for communication and the cost 

for maintaining connections [22,23]. Similar optimization was shown to lead to the 
small world properties [24] and used to explain the appearance of power laws [25]. 
Therefore one may expect the observed scale-free behaviour of PTNs to naturally 
emerge from obvious optimization objectives followed in their design. 

One of the specific features of the PTNs wo have analyzed is the harnessing 
effect: very often several routes go in parallel and pass together several consecutive 
stations. While other networks with real-world links like cables or neurons embed- 
ded in two or three dimensions often show similar behavior, these can be studied in 
detail in our present case. Our empirical analysis of the harness distribution that 
quantifies this behavior indicates power-law behaviour. The same behaviour is in- 
herently recovered by our model. We found that a useful tool to classify PTNs as 
well as to find correspondence between real-world and simulated PTNs is a com- 
parison of the observables in different representations ('spaces'). Furthermore, the 
standard network characteristics as represented in different spaces turn out to be 
natural measures for the quality of public transport in a city. 

Of the many interesting further questions that are related to this study let us 
only mention the vulnerability of PTNs to random failures and targeted attacks 
(see our contribution [14] on this subject in this volume) as well as the correlation 
between the topological properties of a PTN and its geographical embedding. 
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